Zero-Variance Importance Sampling Estimators for Markov Process Expectations

نویسندگان

Hernan P. Awad

Peter W. Glynn

Reuven Y. Rubinstein

چکیده

We study the structure of zero-variance importance sampling estimators for expectations of functionals of Markov processes. For a class of expectations that can be characterized as solutions to linear systems, we show that a zerovariance estimator can be constructed by using an importance distribution that preserves the Markovian nature of the underlying process. This suggests that good practical importance sampling distributions can be found by searching within the class of Markovian probability distributions. The class of expectations considered includes as particular cases, among others: the mean time until hitting a rare set, the expected cumulative discounted reward until hitting a set, the mean duration of an excursion, the transient (deterministic horizon) expectation of a discounted final payoff, and steady-state expectations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variance reduction of estimators arising from Metropolis-Hastings algorithms

The Metropolis–Hastings algorithm is one of the most basic and well-studied Markov chain Monte Carlo methods. It generates a Markov chain which has as limit distribution the target distribution by simulating observations from a different proposal distribution. A proposed value is accepted with some particular probability otherwise the previous value is repeated. As a consequence, the accepted v...

متن کامل

Importance Sampling in Path Space for Diffusion Processes

Importance sampling is a widely used technique to reduce the variance of the Monte Carlo method. It uses the idea of change of measure to design efficient Monte Carlo estimators. In this work, we study the importance sampling method in the framework of diffusion process and consider the change of measures which can be realized by adding a control force to the original dynamics. For certain expo...

متن کامل

Estimating standard errors for importance sampling estimators with multiple Markov chains

The naive importance sampling estimator based on the samples from a single importance density can be extremely numerically unstable. We consider multiple distributions importance sampling estimators where samples from more than one probability distributions are combined to consistently estimate means with respect to given target distributions. These generalized importance sampling estimators pr...

متن کامل

7 Importance Tempering

Simulated tempering (ST) is an established Markov Chain Monte Carlo (MCMC) methodology for sampling from a multimodal density π(θ). The technique involves introducing an auxiliary variable k taking values in a finite subset of [0, 1] and indexing a set of tempered distributions, say π k (θ) ∝ π(θ) k. Small values of k encourage better mixing, but samples from π are only obtained when the joint ...

متن کامل